Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 23547 |
| Missing cells | 66918 |
| Missing cells (%) | 13.5% |
| Duplicate rows | 1 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 3.8 MiB |
| Average record size in memory | 168.0 B |
Variable types
| Categorical | 8 |
|---|---|
| Numeric | 13 |
| Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
Suburb has a high cardinality: 336 distinct values | High cardinality |
Address has a high cardinality: 23108 distinct values | High cardinality |
SellerG has a high cardinality: 330 distinct values | High cardinality |
Date has a high cardinality: 58 distinct values | High cardinality |
Rooms is highly correlated with Bedroom2 | High correlation |
Bedroom2 is highly correlated with Rooms | High correlation |
Price has 5151 (21.9%) missing values | Missing |
Bedroom2 has 4481 (19.0%) missing values | Missing |
Bathroom has 4484 (19.0%) missing values | Missing |
Car has 4626 (19.6%) missing values | Missing |
Landsize has 6137 (26.1%) missing values | Missing |
BuildingArea has 13529 (57.5%) missing values | Missing |
YearBuilt has 12007 (51.0%) missing values | Missing |
CouncilArea has 7891 (33.5%) missing values | Missing |
Lattitude has 4304 (18.3%) missing values | Missing |
Longtitude has 4304 (18.3%) missing values | Missing |
Landsize is highly skewed (γ1 = 106.0673053) | Skewed |
BuildingArea is highly skewed (γ1 = 88.59840308) | Skewed |
Address is uniformly distributed | Uniform |
Car has 1385 (5.9%) zeros | Zeros |
Landsize has 2437 (10.3%) zeros | Zeros |
Reproduction
| Analysis started | 2021-04-17 04:53:29.110028 |
|---|---|
| Analysis finished | 2021-04-17 04:53:54.088753 |
| Duration | 24.98 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 336 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 184.1 KiB |
| Reservoir | 629 |
|---|---|
| Bentleigh East | 429 |
| Richmond | 416 |
| Glen Iris | 378 |
| Kew | 357 |
| Other values (331) |
Length
| Max length | 18 |
|---|---|
| Median length | 9 |
| Mean length | 9.783029685 |
| Min length | 3 |
Characters and Unicode
| Total characters | 230361 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 23 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Abbotsford |
|---|---|
| 2nd row | Abbotsford |
| 3rd row | Abbotsford |
| 4th row | Abbotsford |
| 5th row | Abbotsford |
| Value | Count | Frequency (%) |
| Reservoir | 629 | 2.7% |
| Bentleigh East | 429 | 1.8% |
| Richmond | 416 | 1.8% |
| Glen Iris | 378 | 1.6% |
| Kew | 357 | 1.5% |
| Preston | 357 | 1.5% |
| Brighton | 348 | 1.5% |
| South Yarra | 331 | 1.4% |
| Brunswick | 330 | 1.4% |
| Hawthorn | 318 | 1.4% |
| Other values (326) | 19654 |
| Value | Count | Frequency (%) |
| east | 1964 | 6.0% |
| north | 1277 | 3.9% |
| south | 913 | 2.8% |
| melbourne | 789 | 2.4% |
| west | 786 | 2.4% |
| bentleigh | 663 | 2.0% |
| brunswick | 656 | 2.0% |
| brighton | 641 | 2.0% |
| reservoir | 629 | 1.9% |
| balwyn | 548 | 1.7% |
| Other values (283) | 23600 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 20686 | 9.0% |
| r | 19501 | 8.5% |
| o | 19438 | 8.4% |
| n | 16683 | 7.2% |
| a | 15593 | 6.8% |
| t | 14219 | 6.2% |
| l | 12814 | 5.6% |
| i | 10763 | 4.7% |
| s | 10712 | 4.7% |
| 8919 | 3.9% | |
| Other values (39) | 81033 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 188923 | |
| Uppercase Letter | 32519 | 14.1% |
| Space Separator | 8919 | 3.9% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 20686 | |
| r | 19501 | |
| o | 19438 | |
| n | 16683 | |
| a | 15593 | 8.3% |
| t | 14219 | 7.5% |
| l | 12814 | 6.8% |
| i | 10763 | 5.7% |
| s | 10712 | 5.7% |
| h | 7734 | 4.1% |
| Other values (15) | 40780 |
| Value | Count | Frequency (%) |
| B | 3506 | 10.8% |
| E | 2884 | 8.9% |
| M | 2778 | 8.5% |
| S | 2453 | 7.5% |
| H | 2408 | 7.4% |
| C | 2275 | 7.0% |
| P | 2055 | 6.3% |
| N | 1974 | 6.1% |
| W | 1661 | 5.1% |
| A | 1649 | 5.1% |
| Other values (13) | 8876 |
| Value | Count | Frequency (%) |
| 8919 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 221442 | |
| Common | 8919 | 3.9% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 20686 | 9.3% |
| r | 19501 | 8.8% |
| o | 19438 | 8.8% |
| n | 16683 | 7.5% |
| a | 15593 | 7.0% |
| t | 14219 | 6.4% |
| l | 12814 | 5.8% |
| i | 10763 | 4.9% |
| s | 10712 | 4.8% |
| h | 7734 | 3.5% |
| Other values (38) | 73299 |
| Value | Count | Frequency (%) |
| 8919 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 230361 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 20686 | 9.0% |
| r | 19501 | 8.5% |
| o | 19438 | 8.4% |
| n | 16683 | 7.2% |
| a | 15593 | 6.8% |
| t | 14219 | 6.2% |
| l | 12814 | 5.6% |
| i | 10763 | 4.7% |
| s | 10712 | 4.7% |
| 8919 | 3.9% | |
| Other values (39) | 81033 |
| Distinct | 23108 |
|---|---|
| Distinct (%) | 98.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 184.1 KiB |
| 5 Charles St | 5 |
|---|---|
| 23 Cromwell St | 3 |
| 1 Daisy St | 3 |
| 7 Churchill Av | 3 |
| 36 Aberfeldie St | 3 |
| Other values (23103) |
Length
| Max length | 27 |
|---|---|
| Median length | 13 |
| Mean length | 13.60394105 |
| Min length | 8 |
Characters and Unicode
| Total characters | 320332 |
|---|---|
| Distinct characters | 64 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 22692 ? |
|---|---|
| Unique (%) | 96.4% |
Sample
| 1st row | 68 Studley St |
|---|---|
| 2nd row | 85 Turner St |
| 3rd row | 25 Bloomburg St |
| 4th row | 18/659 Victoria St |
| 5th row | 5 Charles St |
| Value | Count | Frequency (%) |
| 5 Charles St | 5 | < 0.1% |
| 23 Cromwell St | 3 | < 0.1% |
| 1 Daisy St | 3 | < 0.1% |
| 7 Churchill Av | 3 | < 0.1% |
| 36 Aberfeldie St | 3 | < 0.1% |
| 38 Lily St | 3 | < 0.1% |
| 2 Bruce St | 3 | < 0.1% |
| 53 William St | 3 | < 0.1% |
| 1/1 Clarendon St | 3 | < 0.1% |
| 14 Arthur St | 3 | < 0.1% |
| Other values (23098) | 23515 |
| Value | Count | Frequency (%) |
| st | 12272 | 17.3% |
| rd | 4565 | 6.4% |
| av | 2260 | 3.2% |
| ct | 965 | 1.4% |
| cr | 719 | 1.0% |
| dr | 692 | 1.0% |
| gr | 481 | 0.7% |
| 3 | 445 | 0.6% |
| 5 | 428 | 0.6% |
| 4 | 422 | 0.6% |
| Other values (10156) | 47735 |
Most occurring characters
| Value | Count | Frequency (%) |
| 47437 | 14.8% | |
| t | 20425 | 6.4% |
| e | 16701 | 5.2% |
| r | 15173 | 4.7% |
| a | 14749 | 4.6% |
| S | 13919 | 4.3% |
| n | 12834 | 4.0% |
| 1 | 12474 | 3.9% |
| o | 11622 | 3.6% |
| l | 11030 | 3.4% |
| Other values (54) | 143968 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 161156 | |
| Decimal Number | 56174 | 17.5% |
| Uppercase Letter | 48256 | 15.1% |
| Space Separator | 47437 | 14.8% |
| Other Punctuation | 7309 | 2.3% |
Most frequent character per category
| Value | Count | Frequency (%) |
| S | 13919 | |
| R | 5811 | |
| C | 4025 | 8.3% |
| A | 4009 | 8.3% |
| B | 2450 | 5.1% |
| M | 2206 | 4.6% |
| P | 1847 | 3.8% |
| D | 1831 | 3.8% |
| G | 1759 | 3.6% |
| W | 1645 | 3.4% |
| Other values (16) | 8754 |
| Value | Count | Frequency (%) |
| t | 20425 | |
| e | 16701 | |
| r | 15173 | |
| a | 14749 | |
| n | 12834 | 8.0% |
| o | 11622 | 7.2% |
| l | 11030 | 6.8% |
| d | 9990 | 6.2% |
| i | 8838 | 5.5% |
| s | 5941 | 3.7% |
| Other values (16) | 33853 |
| Value | Count | Frequency (%) |
| 1 | 12474 | |
| 2 | 8570 | |
| 3 | 6697 | |
| 4 | 5403 | |
| 5 | 4694 | 8.4% |
| 6 | 4164 | 7.4% |
| 7 | 3782 | 6.7% |
| 0 | 3757 | 6.7% |
| 8 | 3522 | 6.3% |
| 9 | 3111 | 5.5% |
| Value | Count | Frequency (%) |
| 47437 |
| Value | Count | Frequency (%) |
| / | 7309 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 209412 | |
| Common | 110920 |
Most frequent character per script
| Value | Count | Frequency (%) |
| t | 20425 | 9.8% |
| e | 16701 | 8.0% |
| r | 15173 | 7.2% |
| a | 14749 | 7.0% |
| S | 13919 | 6.6% |
| n | 12834 | 6.1% |
| o | 11622 | 5.5% |
| l | 11030 | 5.3% |
| d | 9990 | 4.8% |
| i | 8838 | 4.2% |
| Other values (42) | 74131 |
| Value | Count | Frequency (%) |
| 47437 | ||
| 1 | 12474 | 11.2% |
| 2 | 8570 | 7.7% |
| / | 7309 | 6.6% |
| 3 | 6697 | 6.0% |
| 4 | 5403 | 4.9% |
| 5 | 4694 | 4.2% |
| 6 | 4164 | 3.8% |
| 7 | 3782 | 3.4% |
| 0 | 3757 | 3.4% |
| Other values (2) | 6633 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 320332 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 47437 | 14.8% | |
| t | 20425 | 6.4% |
| e | 16701 | 5.2% |
| r | 15173 | 4.7% |
| a | 14749 | 4.6% |
| S | 13919 | 4.3% |
| n | 12834 | 4.0% |
| 1 | 12474 | 3.9% |
| o | 11622 | 3.6% |
| l | 11030 | 3.4% |
| Other values (54) | 143968 |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.976047904 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 184.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 0.9745005716 |
|---|---|
| Coefficient of variation (CV) | 0.3274478782 |
| Kurtosis | 1.820956665 |
| Mean | 2.976047904 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.502241333 |
| Sum | 70077 |
| Variance | 0.9496513641 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 10076 | |
| 2 | 6115 | |
| 4 | 4953 | |
| 1 | 1119 | 4.8% |
| 5 | 1107 | 4.7% |
| 6 | 133 | 0.6% |
| 7 | 19 | 0.1% |
| 8 | 14 | 0.1% |
| 10 | 5 | < 0.1% |
| 9 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1119 | 4.8% |
| 2 | 6115 | |
| 3 | 10076 | |
| 4 | 4953 | |
| 5 | 1107 | 4.7% |
| Value | Count | Frequency (%) |
| 12 | 2 | < 0.1% |
| 10 | 5 | < 0.1% |
| 9 | 4 | < 0.1% |
| 8 | 14 | |
| 7 | 19 |
Type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 184.1 KiB |
| h | |
|---|---|
| u | |
| t |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 23547 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | h |
|---|---|
| 2nd row | h |
| 3rd row | h |
| 4th row | u |
| 5th row | h |
| Value | Count | Frequency (%) |
| h | 15760 | |
| u | 5280 | 22.4% |
| t | 2507 | 10.6% |
| Value | Count | Frequency (%) |
| h | 15760 | |
| u | 5280 | 22.4% |
| t | 2507 | 10.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| h | 15760 | |
| u | 5280 | 22.4% |
| t | 2507 | 10.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23547 |
Most frequent character per category
| Value | Count | Frequency (%) |
| h | 15760 | |
| u | 5280 | 22.4% |
| t | 2507 | 10.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23547 |
Most frequent character per script
| Value | Count | Frequency (%) |
| h | 15760 | |
| u | 5280 | 22.4% |
| t | 2507 | 10.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23547 |
Most frequent character per block
| Value | Count | Frequency (%) |
| h | 15760 | |
| u | 5280 | 22.4% |
| t | 2507 | 10.6% |
| Distinct | 2470 |
|---|---|
| Distinct (%) | 13.4% |
| Missing | 5151 |
| Missing (%) | 21.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1056697.461 |
|---|---|
| Minimum | 85000 |
| Maximum | 9000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 184.1 KiB |
Quantile statistics
| Minimum | 85000 |
|---|---|
| 5-th percentile | 405000 |
| Q1 | 633000 |
| median | 880000 |
| Q3 | 1302000 |
| 95-th percentile | 2255000 |
| Maximum | 9000000 |
| Range | 8915000 |
| Interquartile range (IQR) | 669000 |
Descriptive statistics
| Standard deviation | 641921.6667 |
|---|---|
| Coefficient of variation (CV) | 0.6074791418 |
| Kurtosis | 10.37309885 |
| Mean | 1056697.461 |
| Median Absolute Deviation (MAD) | 306500 |
| Skewness | 2.366672689 |
| Sum | 1.943900649 × 1010 |
| Variance | 4.120634262 × 1011 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 600000 | 156 | 0.7% |
| 1100000 | 151 | 0.6% |
| 650000 | 149 | 0.6% |
| 800000 | 144 | 0.6% |
| 1200000 | 144 | 0.6% |
| 1300000 | 140 | 0.6% |
| 1000000 | 134 | 0.6% |
| 900000 | 123 | 0.5% |
| 500000 | 116 | 0.5% |
| 750000 | 116 | 0.5% |
| Other values (2460) | 17023 | |
| (Missing) | 5151 | 21.9% |
| Value | Count | Frequency (%) |
| 85000 | 1 | |
| 121000 | 1 | |
| 131000 | 1 | |
| 145000 | 2 | |
| 160000 | 1 |
| Value | Count | Frequency (%) |
| 9000000 | 1 | |
| 8000000 | 1 | |
| 7650000 | 1 | |
| 6800000 | 1 | |
| 6500000 | 1 |
Method
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 184.1 KiB |
| S | |
|---|---|
| SP | |
| PI | |
| VB | |
| SN | 1041 |
| Other values (4) | 479 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.415891621 |
| Min length | 1 |
Characters and Unicode
| Total characters | 33340 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SS |
|---|---|
| 2nd row | S |
| 3rd row | S |
| 4th row | VB |
| 5th row | SP |
| Value | Count | Frequency (%) |
| S | 13660 | |
| SP | 3366 | 14.3% |
| PI | 3140 | 13.3% |
| VB | 1861 | 7.9% |
| SN | 1041 | 4.4% |
| PN | 209 | 0.9% |
| SA | 154 | 0.7% |
| W | 94 | 0.4% |
| SS | 22 | 0.1% |
| Value | Count | Frequency (%) |
| s | 13660 | |
| sp | 3366 | 14.3% |
| pi | 3140 | 13.3% |
| vb | 1861 | 7.9% |
| sn | 1041 | 4.4% |
| pn | 209 | 0.9% |
| sa | 154 | 0.7% |
| w | 94 | 0.4% |
| ss | 22 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 18265 | |
| P | 6715 | 20.1% |
| I | 3140 | 9.4% |
| V | 1861 | 5.6% |
| B | 1861 | 5.6% |
| N | 1250 | 3.7% |
| A | 154 | 0.5% |
| W | 94 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 33340 |
Most frequent character per category
| Value | Count | Frequency (%) |
| S | 18265 | |
| P | 6715 | 20.1% |
| I | 3140 | 9.4% |
| V | 1861 | 5.6% |
| B | 1861 | 5.6% |
| N | 1250 | 3.7% |
| A | 154 | 0.5% |
| W | 94 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 33340 |
Most frequent character per script
| Value | Count | Frequency (%) |
| S | 18265 | |
| P | 6715 | 20.1% |
| I | 3140 | 9.4% |
| V | 1861 | 5.6% |
| B | 1861 | 5.6% |
| N | 1250 | 3.7% |
| A | 154 | 0.5% |
| W | 94 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33340 |
Most frequent character per block
| Value | Count | Frequency (%) |
| S | 18265 | |
| P | 6715 | 20.1% |
| I | 3140 | 9.4% |
| V | 1861 | 5.6% |
| B | 1861 | 5.6% |
| N | 1250 | 3.7% |
| A | 154 | 0.5% |
| W | 94 | 0.3% |
| Distinct | 330 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 184.1 KiB |
| Nelson | |
|---|---|
| Jellis | |
| Barry | |
| hockingstuart | |
| Marshall | |
| Other values (325) |
Length
| Max length | 27 |
|---|---|
| Median length | 6 |
| Mean length | 6.384380176 |
| Min length | 1 |
Characters and Unicode
| Total characters | 150333 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 86 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Jellis |
|---|---|
| 2nd row | Biggin |
| 3rd row | Biggin |
| 4th row | Rounds |
| 5th row | Biggin |
| Value | Count | Frequency (%) |
| Nelson | 2374 | 10.1% |
| Jellis | 2320 | 9.9% |
| Barry | 1998 | 8.5% |
| hockingstuart | 1943 | 8.3% |
| Marshall | 1474 | 6.3% |
| Ray | 1251 | 5.3% |
| Buxton | 1242 | 5.3% |
| Biggin | 665 | 2.8% |
| Fletchers | 571 | 2.4% |
| Woodards | 510 | 2.2% |
| Other values (320) | 9199 |
| Value | Count | Frequency (%) |
| nelson | 2374 | 10.1% |
| jellis | 2320 | 9.9% |
| barry | 1998 | 8.5% |
| hockingstuart | 1943 | 8.3% |
| marshall | 1474 | 6.3% |
| ray | 1251 | 5.3% |
| buxton | 1242 | 5.3% |
| biggin | 665 | 2.8% |
| fletchers | 571 | 2.4% |
| woodards | 510 | 2.2% |
| Other values (317) | 9199 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 13679 | 9.1% |
| a | 13061 | 8.7% |
| r | 12158 | 8.1% |
| s | 11803 | 7.9% |
| e | 10765 | 7.2% |
| o | 9423 | 6.3% |
| n | 8665 | 5.8% |
| i | 8360 | 5.6% |
| t | 7064 | 4.7% |
| h | 5197 | 3.5% |
| Other values (48) | 50158 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 125664 | |
| Uppercase Letter | 24175 | 16.1% |
| Other Punctuation | 304 | 0.2% |
| Decimal Number | 190 | 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| B | 4841 | |
| N | 2857 | |
| J | 2764 | |
| M | 2564 | |
| R | 2362 | |
| G | 1111 | 4.6% |
| W | 985 | 4.1% |
| H | 800 | 3.3% |
| F | 689 | 2.9% |
| T | 666 | 2.8% |
| Other values (16) | 4536 |
| Value | Count | Frequency (%) |
| l | 13679 | |
| a | 13061 | |
| r | 12158 | |
| s | 11803 | |
| e | 10765 | |
| o | 9423 | 7.5% |
| n | 8665 | 6.9% |
| i | 8360 | 6.7% |
| t | 7064 | 5.6% |
| h | 5197 | 4.1% |
| Other values (15) | 25489 |
| Value | Count | Frequency (%) |
| ' | 177 | |
| . | 53 | 17.4% |
| & | 48 | 15.8% |
| / | 20 | 6.6% |
| @ | 6 | 2.0% |
| Value | Count | Frequency (%) |
| 2 | 95 | |
| 1 | 95 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 149839 | |
| Common | 494 | 0.3% |
Most frequent character per script
| Value | Count | Frequency (%) |
| l | 13679 | 9.1% |
| a | 13061 | 8.7% |
| r | 12158 | 8.1% |
| s | 11803 | 7.9% |
| e | 10765 | 7.2% |
| o | 9423 | 6.3% |
| n | 8665 | 5.8% |
| i | 8360 | 5.6% |
| t | 7064 | 4.7% |
| h | 5197 | 3.5% |
| Other values (41) | 49664 |
| Value | Count | Frequency (%) |
| ' | 177 | |
| 2 | 95 | |
| 1 | 95 | |
| . | 53 | 10.7% |
| & | 48 | 9.7% |
| / | 20 | 4.0% |
| @ | 6 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 150333 |
Most frequent character per block
| Value | Count | Frequency (%) |
| l | 13679 | 9.1% |
| a | 13061 | 8.7% |
| r | 12158 | 8.1% |
| s | 11803 | 7.9% |
| e | 10765 | 7.2% |
| o | 9423 | 6.3% |
| n | 8665 | 5.8% |
| i | 8360 | 5.6% |
| t | 7064 | 4.7% |
| h | 5197 | 3.5% |
| Other values (48) | 50158 |
| Distinct | 58 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 184.1 KiB |
| 27-05-2017 | 770 |
|---|---|
| 23-09-2017 | 742 |
| 16-09-2017 | 730 |
| 03-06-2017 | 689 |
| 26-08-2017 | 647 |
| Other values (53) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 235470 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 03-09-2016 |
|---|---|
| 2nd row | 03-12-2016 |
| 3rd row | 04-02-2016 |
| 4th row | 04-02-2016 |
| 5th row | 04-03-2017 |
| Value | Count | Frequency (%) |
| 27-05-2017 | 770 | 3.3% |
| 23-09-2017 | 742 | 3.2% |
| 16-09-2017 | 730 | 3.1% |
| 03-06-2017 | 689 | 2.9% |
| 26-08-2017 | 647 | 2.7% |
| 17-06-2017 | 637 | 2.7% |
| 24-06-2017 | 607 | 2.6% |
| 09-09-2017 | 598 | 2.5% |
| 27-11-2016 | 575 | 2.4% |
| 03-09-2017 | 567 | 2.4% |
| Other values (48) | 16985 |
| Value | Count | Frequency (%) |
| 27-05-2017 | 770 | 3.3% |
| 23-09-2017 | 742 | 3.2% |
| 16-09-2017 | 730 | 3.1% |
| 03-06-2017 | 689 | 2.9% |
| 26-08-2017 | 647 | 2.7% |
| 17-06-2017 | 637 | 2.7% |
| 24-06-2017 | 607 | 2.6% |
| 09-09-2017 | 598 | 2.5% |
| 27-11-2016 | 575 | 2.4% |
| 03-09-2017 | 567 | 2.4% |
| Other values (48) | 16985 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 52515 | |
| - | 47094 | |
| 1 | 37925 | |
| 2 | 36115 | |
| 7 | 19655 | 8.3% |
| 6 | 16089 | 6.8% |
| 9 | 6487 | 2.8% |
| 8 | 5955 | 2.5% |
| 3 | 5007 | 2.1% |
| 5 | 4858 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 188376 | |
| Dash Punctuation | 47094 | 20.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 52515 | |
| 1 | 37925 | |
| 2 | 36115 | |
| 7 | 19655 | 10.4% |
| 6 | 16089 | 8.5% |
| 9 | 6487 | 3.4% |
| 8 | 5955 | 3.2% |
| 3 | 5007 | 2.7% |
| 5 | 4858 | 2.6% |
| 4 | 3770 | 2.0% |
| Value | Count | Frequency (%) |
| - | 47094 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 235470 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 52515 | |
| - | 47094 | |
| 1 | 37925 | |
| 2 | 36115 | |
| 7 | 19655 | 8.3% |
| 6 | 16089 | 6.8% |
| 9 | 6487 | 2.8% |
| 8 | 5955 | 2.5% |
| 3 | 5007 | 2.1% |
| 5 | 4858 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 235470 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 52515 | |
| - | 47094 | |
| 1 | 37925 | |
| 2 | 36115 | |
| 7 | 19655 | 8.3% |
| 6 | 16089 | 6.8% |
| 9 | 6487 | 2.8% |
| 8 | 5955 | 2.5% |
| 3 | 5007 | 2.1% |
| 5 | 4858 | 2.1% |
Distance
Real number (ℝ≥0)
| Distinct | 211 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.30651491 |
|---|---|
| Minimum | 0 |
| Maximum | 48.1 |
| Zeros | 32 |
| Zeros (%) | 0.1% |
| Memory size | 184.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2.6 |
| Q1 | 6.2 |
| median | 9.5 |
| Q3 | 13 |
| 95-th percentile | 21.1 |
| Maximum | 48.1 |
| Range | 48.1 |
| Interquartile range (IQR) | 6.8 |
Descriptive statistics
| Standard deviation | 6.016318012 |
|---|---|
| Coefficient of variation (CV) | 0.5837393208 |
| Kurtosis | 5.16715539 |
| Mean | 10.30651491 |
| Median Absolute Deviation (MAD) | 3.5 |
| Skewness | 1.674115852 |
| Sum | 242677.2 |
| Variance | 36.19608242 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 11.2 | 1272 | 5.4% |
| 9.2 | 665 | 2.8% |
| 7.8 | 570 | 2.4% |
| 13.9 | 499 | 2.1% |
| 4.6 | 473 | 2.0% |
| 13 | 427 | 1.8% |
| 13.8 | 425 | 1.8% |
| 10.5 | 395 | 1.7% |
| 5.2 | 383 | 1.6% |
| 11.4 | 382 | 1.6% |
| Other values (201) | 18055 |
| Value | Count | Frequency (%) |
| 0 | 32 | |
| 0.7 | 16 | 0.1% |
| 1.2 | 47 | |
| 1.3 | 15 | 0.1% |
| 1.4 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 48.1 | 4 | < 0.1% |
| 47.4 | 3 | < 0.1% |
| 47.3 | 9 | |
| 45.9 | 14 | |
| 45.2 | 1 | < 0.1% |
Postcode
Real number (ℝ≥0)
| Distinct | 206 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3109.782893 |
|---|---|
| Minimum | 3000 |
| Maximum | 3978 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 184.1 KiB |
Quantile statistics
| Minimum | 3000 |
|---|---|
| 5-th percentile | 3013 |
| Q1 | 3047 |
| median | 3101 |
| Q3 | 3150 |
| 95-th percentile | 3204 |
| Maximum | 3978 |
| Range | 978 |
| Interquartile range (IQR) | 103 |
Descriptive statistics
| Standard deviation | 94.52218971 |
|---|---|
| Coefficient of variation (CV) | 0.03039510891 |
| Kurtosis | 28.59355041 |
| Mean | 3109.782893 |
| Median Absolute Deviation (MAD) | 50 |
| Skewness | 4.163565827 |
| Sum | 73222948 |
| Variance | 8934.444347 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3073 | 629 | 2.7% |
| 3046 | 490 | 2.1% |
| 3020 | 460 | 2.0% |
| 3121 | 457 | 1.9% |
| 3165 | 429 | 1.8% |
| 3058 | 414 | 1.8% |
| 3163 | 411 | 1.7% |
| 3040 | 404 | 1.7% |
| 3032 | 380 | 1.6% |
| 3204 | 379 | 1.6% |
| Other values (196) | 19093 |
| Value | Count | Frequency (%) |
| 3000 | 159 | |
| 3002 | 44 | 0.2% |
| 3003 | 53 | 0.2% |
| 3006 | 63 | 0.3% |
| 3008 | 14 | 0.1% |
| Value | Count | Frequency (%) |
| 3978 | 3 | < 0.1% |
| 3977 | 14 | |
| 3976 | 7 | |
| 3975 | 1 | < 0.1% |
| 3910 | 11 |
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 4481 |
| Missing (%) | 19.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.951956362 |
|---|---|
| Minimum | 0 |
| Maximum | 30 |
| Zeros | 17 |
| Zeros (%) | 0.1% |
| Memory size | 184.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 30 |
| Range | 30 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 0.9960317812 |
|---|---|
| Coefficient of variation (CV) | 0.3374141278 |
| Kurtosis | 33.60731924 |
| Mean | 2.951956362 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.667158799 |
| Sum | 56282 |
| Variance | 0.9920793091 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 8207 | |
| 2 | 5059 | |
| 4 | 3831 | |
| 1 | 942 | 4.0% |
| 5 | 873 | 3.7% |
| 6 | 104 | 0.4% |
| 7 | 17 | 0.1% |
| 0 | 17 | 0.1% |
| 8 | 8 | < 0.1% |
| 9 | 5 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
| (Missing) | 4481 |
| Value | Count | Frequency (%) |
| 0 | 17 | 0.1% |
| 1 | 942 | 4.0% |
| 2 | 5059 | |
| 3 | 8207 | |
| 4 | 3831 |
| Value | Count | Frequency (%) |
| 30 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 9 | 5 | |
| 8 | 8 |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 4484 |
| Missing (%) | 19.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.570896501 |
|---|---|
| Minimum | 0 |
| Maximum | 12 |
| Zeros | 46 |
| Zeros (%) | 0.2% |
| Memory size | 184.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 12 |
| Range | 12 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7126836776 |
|---|---|
| Coefficient of variation (CV) | 0.4536795881 |
| Kurtosis | 5.263207479 |
| Mean | 1.570896501 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.433137743 |
| Sum | 29946 |
| Variance | 0.5079180243 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 10080 | |
| 2 | 7273 | |
| 3 | 1422 | 6.0% |
| 4 | 180 | 0.8% |
| 5 | 51 | 0.2% |
| 0 | 46 | 0.2% |
| 6 | 5 | < 0.1% |
| 7 | 3 | < 0.1% |
| 8 | 2 | < 0.1% |
| 12 | 1 | < 0.1% |
| (Missing) | 4484 |
| Value | Count | Frequency (%) |
| 0 | 46 | 0.2% |
| 1 | 10080 | |
| 2 | 7273 | |
| 3 | 1422 | 6.0% |
| 4 | 180 | 0.8% |
| Value | Count | Frequency (%) |
| 12 | 1 | < 0.1% |
| 8 | 2 | < 0.1% |
| 7 | 3 | < 0.1% |
| 6 | 5 | < 0.1% |
| 5 | 51 |
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 4626 |
| Missing (%) | 19.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.6262354 |
|---|---|
| Minimum | 0 |
| Maximum | 26 |
| Zeros | 1385 |
| Zeros (%) | 5.9% |
| Memory size | 184.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 26 |
| Range | 26 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9740484799 |
|---|---|
| Coefficient of variation (CV) | 0.5989590929 |
| Kurtosis | 25.63523817 |
| Mean | 1.6262354 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.136242257 |
| Sum | 30770 |
| Variance | 0.9487704412 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 8088 | |
| 1 | 7513 | |
| 0 | 1385 | 5.9% |
| 3 | 1045 | 4.4% |
| 4 | 693 | 2.9% |
| 5 | 87 | 0.4% |
| 6 | 79 | 0.3% |
| 8 | 14 | 0.1% |
| 7 | 11 | < 0.1% |
| 10 | 3 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
| (Missing) | 4626 |
| Value | Count | Frequency (%) |
| 0 | 1385 | 5.9% |
| 1 | 7513 | |
| 2 | 8088 | |
| 3 | 1045 | 4.4% |
| 4 | 693 | 2.9% |
| Value | Count | Frequency (%) |
| 26 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 10 | 3 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 14 |
| Distinct | 1567 |
|---|---|
| Distinct (%) | 9.0% |
| Missing | 6137 |
| Missing (%) | 26.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 551.7834578 |
|---|---|
| Minimum | 0 |
| Maximum | 433014 |
| Zeros | 2437 |
| Zeros (%) | 10.3% |
| Memory size | 184.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 181 |
| median | 448 |
| Q3 | 656 |
| 95-th percentile | 1005 |
| Maximum | 433014 |
| Range | 433014 |
| Interquartile range (IQR) | 475 |
Descriptive statistics
| Standard deviation | 3544.288014 |
|---|---|
| Coefficient of variation (CV) | 6.423331406 |
| Kurtosis | 12762.56697 |
| Mean | 551.7834578 |
| Median Absolute Deviation (MAD) | 238 |
| Skewness | 106.0673053 |
| Sum | 9606550 |
| Variance | 12561977.52 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2437 | 10.3% |
| 650 | 125 | 0.5% |
| 697 | 89 | 0.4% |
| 585 | 67 | 0.3% |
| 700 | 58 | 0.2% |
| 696 | 54 | 0.2% |
| 590 | 50 | 0.2% |
| 534 | 49 | 0.2% |
| 600 | 46 | 0.2% |
| 604 | 46 | 0.2% |
| Other values (1557) | 14389 | |
| (Missing) | 6137 |
| Value | Count | Frequency (%) |
| 0 | 2437 | |
| 1 | 3 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 433014 | 1 | |
| 76000 | 1 | |
| 75100 | 1 | |
| 44500 | 1 | |
| 41400 | 1 |
| Distinct | 688 |
|---|---|
| Distinct (%) | 6.9% |
| Missing | 13529 |
| Missing (%) | 57.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 154.5278952 |
|---|---|
| Minimum | 0 |
| Maximum | 44515 |
| Zeros | 30 |
| Zeros (%) | 0.1% |
| Memory size | 184.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 52 |
| Q1 | 95 |
| median | 129 |
| Q3 | 180 |
| 95-th percentile | 305 |
| Maximum | 44515 |
| Range | 44515 |
| Interquartile range (IQR) | 85 |
Descriptive statistics
| Standard deviation | 462.5357653 |
|---|---|
| Coefficient of variation (CV) | 2.99321857 |
| Kurtosis | 8454.277997 |
| Mean | 154.5278952 |
| Median Absolute Deviation (MAD) | 40 |
| Skewness | 88.59840308 |
| Sum | 1548060.454 |
| Variance | 213939.3342 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 120 | 144 | 0.6% |
| 100 | 129 | 0.5% |
| 110 | 123 | 0.5% |
| 130 | 117 | 0.5% |
| 115 | 102 | 0.4% |
| 150 | 96 | 0.4% |
| 125 | 92 | 0.4% |
| 112 | 89 | 0.4% |
| 80 | 89 | 0.4% |
| 140 | 85 | 0.4% |
| Other values (678) | 8952 | |
| (Missing) | 13529 |
| Value | Count | Frequency (%) |
| 0 | 30 | |
| 0.01 | 1 | < 0.1% |
| 1 | 13 | |
| 2 | 18 | |
| 3 | 23 |
| Value | Count | Frequency (%) |
| 44515 | 1 | |
| 6791 | 1 | |
| 4645 | 1 | |
| 3647 | 1 | |
| 3558 | 1 |
| Distinct | 155 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 12007 |
| Missing (%) | 51.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1964.636742 |
|---|---|
| Minimum | 1196 |
| Maximum | 2106 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 184.1 KiB |
Quantile statistics
| Minimum | 1196 |
|---|---|
| 5-th percentile | 1900 |
| Q1 | 1940 |
| median | 1970 |
| Q3 | 2000 |
| 95-th percentile | 2012 |
| Maximum | 2106 |
| Range | 910 |
| Interquartile range (IQR) | 60 |
Descriptive statistics
| Standard deviation | 37.59550363 |
|---|---|
| Coefficient of variation (CV) | 0.0191361094 |
| Kurtosis | 14.37586387 |
| Mean | 1964.636742 |
| Median Absolute Deviation (MAD) | 30 |
| Skewness | -1.228668418 |
| Sum | 22671908 |
| Variance | 1413.421893 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1970 | 1178 | 5.0% |
| 1960 | 971 | 4.1% |
| 1950 | 787 | 3.3% |
| 1900 | 480 | 2.0% |
| 1980 | 470 | 2.0% |
| 2000 | 449 | 1.9% |
| 1930 | 406 | 1.7% |
| 1920 | 405 | 1.7% |
| 1890 | 345 | 1.5% |
| 1910 | 330 | 1.4% |
| Other values (145) | 5719 | |
| (Missing) | 12007 |
| Value | Count | Frequency (%) |
| 1196 | 1 | < 0.1% |
| 1800 | 1 | < 0.1% |
| 1830 | 1 | < 0.1% |
| 1850 | 4 | |
| 1854 | 2 |
| Value | Count | Frequency (%) |
| 2106 | 1 | < 0.1% |
| 2018 | 1 | < 0.1% |
| 2017 | 24 | 0.1% |
| 2016 | 77 | |
| 2015 | 98 |
| Distinct | 34 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 7891 |
| Missing (%) | 33.5% |
| Memory size | 184.1 KiB |
| Boroondara | |
|---|---|
| Moreland | |
| Stonnington | |
| Moonee Valley | |
| Darebin | |
| Other values (29) |
Length
| Max length | 17 |
|---|---|
| Median length | 9 |
| Mean length | 9.085781809 |
| Min length | 4 |
Characters and Unicode
| Total characters | 142247 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Yarra |
|---|---|
| 2nd row | Yarra |
| 3rd row | Yarra |
| 4th row | Yarra |
| 5th row | Yarra |
| Value | Count | Frequency (%) |
| Boroondara | 1677 | 7.1% |
| Moreland | 1421 | 6.0% |
| Stonnington | 1141 | 4.8% |
| Moonee Valley | 1141 | 4.8% |
| Darebin | 1113 | 4.7% |
| Glen Eira | 1019 | 4.3% |
| Port Phillip | 849 | 3.6% |
| Yarra | 836 | 3.6% |
| Maribyrnong | 836 | 3.6% |
| Banyule | 763 | 3.2% |
| Other values (24) | 4860 | |
| (Missing) | 7891 |
| Value | Count | Frequency (%) |
| boroondara | 1677 | 8.7% |
| moreland | 1421 | 7.4% |
| moonee | 1141 | 5.9% |
| valley | 1141 | 5.9% |
| stonnington | 1141 | 5.9% |
| darebin | 1113 | 5.8% |
| eira | 1019 | 5.3% |
| glen | 1019 | 5.3% |
| yarra | 863 | 4.5% |
| phillip | 849 | 4.4% |
| Other values (29) | 7900 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 17856 | |
| o | 15986 | |
| a | 15155 | 10.7% |
| r | 13007 | 9.1% |
| e | 11445 | 8.0% |
| i | 8321 | 5.8% |
| l | 8129 | 5.7% |
| M | 5022 | 3.5% |
| t | 4415 | 3.1% |
| d | 4114 | 2.9% |
| Other values (29) | 38797 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 119335 | |
| Uppercase Letter | 19284 | 13.6% |
| Space Separator | 3628 | 2.6% |
Most frequent character per category
| Value | Count | Frequency (%) |
| n | 17856 | |
| o | 15986 | |
| a | 15155 | |
| r | 13007 | |
| e | 11445 | |
| i | 8321 | |
| l | 8129 | |
| t | 4415 | 3.7% |
| d | 4114 | 3.4% |
| y | 4099 | 3.4% |
| Other values (11) | 16808 |
| Value | Count | Frequency (%) |
| M | 5022 | |
| B | 4112 | |
| P | 1698 | 8.8% |
| D | 1191 | 6.2% |
| V | 1141 | 5.9% |
| S | 1141 | 5.9% |
| G | 1097 | 5.7% |
| E | 1019 | 5.3% |
| Y | 863 | 4.5% |
| W | 756 | 3.9% |
| Other values (7) | 1244 | 6.5% |
| Value | Count | Frequency (%) |
| 3628 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 138619 | |
| Common | 3628 | 2.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| n | 17856 | |
| o | 15986 | |
| a | 15155 | |
| r | 13007 | 9.4% |
| e | 11445 | 8.3% |
| i | 8321 | 6.0% |
| l | 8129 | 5.9% |
| M | 5022 | 3.6% |
| t | 4415 | 3.2% |
| d | 4114 | 3.0% |
| Other values (28) | 35169 |
| Value | Count | Frequency (%) |
| 3628 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 142247 |
Most frequent character per block
| Value | Count | Frequency (%) |
| n | 17856 | |
| o | 15986 | |
| a | 15155 | 10.7% |
| r | 13007 | 9.1% |
| e | 11445 | 8.0% |
| i | 8321 | 5.8% |
| l | 8129 | 5.7% |
| M | 5022 | 3.5% |
| t | 4415 | 3.1% |
| d | 4114 | 2.9% |
| Other values (29) | 38797 |
| Distinct | 8837 |
|---|---|
| Distinct (%) | 45.9% |
| Missing | 4304 |
| Missing (%) | 18.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -37.81243432 |
|---|---|
| Minimum | -38.18418 |
| Maximum | -37.40758 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 184.1 KiB |
Quantile statistics
| Minimum | -38.18418 |
|---|---|
| 5-th percentile | -37.935737 |
| Q1 | -37.8593 |
| median | -37.8097 |
| Q3 | -37.7598 |
| 95-th percentile | -37.6987 |
| Maximum | -37.40758 |
| Range | 0.7766 |
| Interquartile range (IQR) | 0.0995 |
Descriptive statistics
| Standard deviation | 0.07992583541 |
|---|---|
| Coefficient of variation (CV) | -0.002113744773 |
| Kurtosis | 1.705319878 |
| Mean | -37.81243432 |
| Median Absolute Deviation (MAD) | 0.0497 |
| Skewness | -0.3165347554 |
| Sum | -727624.6735 |
| Variance | 0.006388139166 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| -37.8361 | 25 | 0.1% |
| -37.8424 | 22 | 0.1% |
| -37.8198 | 20 | 0.1% |
| -37.7956 | 17 | 0.1% |
| -37.8414 | 17 | 0.1% |
| -37.7969 | 17 | 0.1% |
| -37.8161 | 16 | 0.1% |
| -37.847 | 16 | 0.1% |
| -37.851 | 16 | 0.1% |
| -37.7634 | 16 | 0.1% |
| Other values (8827) | 19061 | |
| (Missing) | 4304 | 18.3% |
| Value | Count | Frequency (%) |
| -38.18418 | 1 | |
| -38.18255 | 1 | |
| -38.18163 | 1 | |
| -38.17829 | 1 | |
| -38.17745 | 1 |
| Value | Count | Frequency (%) |
| -37.40758 | 1 | |
| -37.40853 | 1 | |
| -37.41318 | 1 | |
| -37.41381 | 1 | |
| -37.41495 | 1 |
| Distinct | 9584 |
|---|---|
| Distinct (%) | 49.8% |
| Missing | 4304 |
| Missing (%) | 18.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 145.0002867 |
|---|---|
| Minimum | 144.43162 |
| Maximum | 145.52635 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 184.1 KiB |
Quantile statistics
| Minimum | 144.43162 |
|---|---|
| 5-th percentile | 144.833765 |
| Q1 | 144.9393 |
| median | 145.0043 |
| Q3 | 145.0631 |
| 95-th percentile | 145.164607 |
| Maximum | 145.52635 |
| Range | 1.09473 |
| Interquartile range (IQR) | 0.1238 |
Descriptive statistics
| Standard deviation | 0.106070626 |
|---|---|
| Coefficient of variation (CV) | 0.0007315201127 |
| Kurtosis | 1.904059951 |
| Mean | 145.0002867 |
| Median Absolute Deviation (MAD) | 0.06159 |
| Skewness | -0.3123235122 |
| Sum | 2790240.516 |
| Variance | 0.01125097771 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 144.9966 | 20 | 0.1% |
| 144.985 | 16 | 0.1% |
| 145.0104 | 16 | 0.1% |
| 145.0001 | 16 | 0.1% |
| 144.991 | 16 | 0.1% |
| 145.0243 | 16 | 0.1% |
| 144.9911 | 15 | 0.1% |
| 144.9679 | 15 | 0.1% |
| 144.997 | 15 | 0.1% |
| 145.0116 | 14 | 0.1% |
| Other values (9574) | 19084 | |
| (Missing) | 4304 | 18.3% |
| Value | Count | Frequency (%) |
| 144.43162 | 1 | |
| 144.43181 | 1 | |
| 144.48571 | 1 | |
| 144.54022 | 1 | |
| 144.54237 | 1 |
| Value | Count | Frequency (%) |
| 145.52635 | 1 | |
| 145.51137 | 1 | |
| 145.48273 | 1 | |
| 145.47052 | 1 | |
| 145.46271 | 1 |
Regionname
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 184.1 KiB |
| Southern Metropolitan | |
|---|---|
| Northern Metropolitan | |
| Western Metropolitan | |
| Eastern Metropolitan | |
| South-Eastern Metropolitan | 857 |
| Other values (3) | 236 |
Length
| Max length | 26 |
|---|---|
| Median length | 21 |
| Mean length | 20.8293553 |
| Min length | 16 |
Characters and Unicode
| Total characters | 490448 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Northern Metropolitan |
|---|---|
| 2nd row | Northern Metropolitan |
| 3rd row | Northern Metropolitan |
| 4th row | Northern Metropolitan |
| 5th row | Northern Metropolitan |
| Value | Count | Frequency (%) |
| Southern Metropolitan | 8772 | |
| Northern Metropolitan | 6480 | |
| Western Metropolitan | 4561 | |
| Eastern Metropolitan | 2640 | 11.2% |
| South-Eastern Metropolitan | 857 | 3.6% |
| Eastern Victoria | 107 | 0.5% |
| Northern Victoria | 78 | 0.3% |
| Western Victoria | 51 | 0.2% |
| (Missing) | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| metropolitan | 23310 | |
| southern | 8772 | 18.6% |
| northern | 6558 | 13.9% |
| western | 4612 | 9.8% |
| eastern | 2747 | 5.8% |
| south-eastern | 857 | 1.8% |
| victoria | 236 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 71259 | |
| o | 63043 | |
| r | 53650 | |
| e | 51468 | |
| n | 46856 | |
| a | 27150 | 5.5% |
| i | 23782 | 4.8% |
| 23546 | 4.8% | |
| M | 23310 | 4.8% |
| p | 23310 | 4.8% |
| Other values (11) | 83074 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 418096 | |
| Uppercase Letter | 47949 | 9.8% |
| Space Separator | 23546 | 4.8% |
| Dash Punctuation | 857 | 0.2% |
Most frequent character per category
| Value | Count | Frequency (%) |
| t | 71259 | |
| o | 63043 | |
| r | 53650 | |
| e | 51468 | |
| n | 46856 | |
| a | 27150 | 6.5% |
| i | 23782 | 5.7% |
| p | 23310 | 5.6% |
| l | 23310 | 5.6% |
| h | 16187 | 3.9% |
| Other values (3) | 18081 | 4.3% |
| Value | Count | Frequency (%) |
| M | 23310 | |
| S | 9629 | |
| N | 6558 | 13.7% |
| W | 4612 | 9.6% |
| E | 3604 | 7.5% |
| V | 236 | 0.5% |
| Value | Count | Frequency (%) |
| 23546 |
| Value | Count | Frequency (%) |
| - | 857 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 466045 | |
| Common | 24403 | 5.0% |
Most frequent character per script
| Value | Count | Frequency (%) |
| t | 71259 | |
| o | 63043 | |
| r | 53650 | |
| e | 51468 | |
| n | 46856 | |
| a | 27150 | 5.8% |
| i | 23782 | 5.1% |
| M | 23310 | 5.0% |
| p | 23310 | 5.0% |
| l | 23310 | 5.0% |
| Other values (9) | 58907 |
| Value | Count | Frequency (%) |
| 23546 | ||
| - | 857 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 490448 |
Most frequent character per block
| Value | Count | Frequency (%) |
| t | 71259 | |
| o | 63043 | |
| r | 53650 | |
| e | 51468 | |
| n | 46856 | |
| a | 27150 | 5.5% |
| i | 23782 | 4.8% |
| 23546 | 4.8% | |
| M | 23310 | 4.8% |
| p | 23310 | 4.8% |
| Other values (11) | 83074 |
Propertycount
Real number (ℝ≥0)
| Distinct | 330 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7517.480591 |
|---|---|
| Minimum | 129 |
| Maximum | 21650 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 184.1 KiB |
Quantile statistics
| Minimum | 129 |
|---|---|
| 5-th percentile | 2185 |
| Q1 | 4385 |
| median | 6567 |
| Q3 | 10331 |
| 95-th percentile | 15321 |
| Maximum | 21650 |
| Range | 21521 |
| Interquartile range (IQR) | 5946 |
Descriptive statistics
| Standard deviation | 4414.995634 |
|---|---|
| Coefficient of variation (CV) | 0.5872972441 |
| Kurtosis | 1.138227076 |
| Mean | 7517.480591 |
| Median Absolute Deviation (MAD) | 2694 |
| Skewness | 1.056708738 |
| Sum | 177006598 |
| Variance | 19492186.45 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 21650 | 629 | 2.7% |
| 8870 | 524 | 2.2% |
| 10969 | 429 | 1.8% |
| 14949 | 416 | 1.8% |
| 10412 | 378 | 1.6% |
| 10331 | 357 | 1.5% |
| 14577 | 357 | 1.5% |
| 10579 | 348 | 1.5% |
| 14887 | 331 | 1.4% |
| 11918 | 330 | 1.4% |
| Other values (320) | 19447 |
| Value | Count | Frequency (%) |
| 129 | 1 | < 0.1% |
| 249 | 1 | < 0.1% |
| 389 | 11 | |
| 394 | 16 | |
| 438 | 9 |
| Value | Count | Frequency (%) |
| 21650 | 629 | |
| 17496 | 159 | 0.7% |
| 17384 | 9 | < 0.1% |
| 17093 | 21 | 0.1% |
| 17055 | 47 | 0.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Suburb | Address | Rooms | Type | Price | Method | SellerG | Date | Distance | Postcode | Bedroom2 | Bathroom | Car | Landsize | BuildingArea | YearBuilt | CouncilArea | Lattitude | Longtitude | Regionname | Propertycount | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Abbotsford | 68 Studley St | 2 | h | NaN | SS | Jellis | 03-09-2016 | 2.5 | 3067.0 | 2.0 | 1.0 | 1.0 | 126.0 | NaN | NaN | Yarra | -37.8014 | 144.9958 | Northern Metropolitan | 4019.0 |
| 1 | Abbotsford | 85 Turner St | 2 | h | 1480000.0 | S | Biggin | 03-12-2016 | 2.5 | 3067.0 | 2.0 | 1.0 | 1.0 | 202.0 | NaN | NaN | Yarra | -37.7996 | 144.9984 | Northern Metropolitan | 4019.0 |
| 2 | Abbotsford | 25 Bloomburg St | 2 | h | 1035000.0 | S | Biggin | 04-02-2016 | 2.5 | 3067.0 | 2.0 | 1.0 | 0.0 | 156.0 | 79.0 | 1900.0 | Yarra | -37.8079 | 144.9934 | Northern Metropolitan | 4019.0 |
| 3 | Abbotsford | 18/659 Victoria St | 3 | u | NaN | VB | Rounds | 04-02-2016 | 2.5 | 3067.0 | 3.0 | 2.0 | 1.0 | 0.0 | NaN | NaN | Yarra | -37.8114 | 145.0116 | Northern Metropolitan | 4019.0 |
| 4 | Abbotsford | 5 Charles St | 3 | h | 1465000.0 | SP | Biggin | 04-03-2017 | 2.5 | 3067.0 | 3.0 | 2.0 | 0.0 | 134.0 | 150.0 | 1900.0 | Yarra | -37.8093 | 144.9944 | Northern Metropolitan | 4019.0 |
| 5 | Abbotsford | 40 Federation La | 3 | h | 850000.0 | PI | Biggin | 04-03-2017 | 2.5 | 3067.0 | 3.0 | 2.0 | 1.0 | 94.0 | NaN | NaN | Yarra | -37.7969 | 144.9969 | Northern Metropolitan | 4019.0 |
| 6 | Abbotsford | 55a Park St | 4 | h | 1600000.0 | VB | Nelson | 04-06-2016 | 2.5 | 3067.0 | 3.0 | 1.0 | 2.0 | 120.0 | 142.0 | 2014.0 | Yarra | -37.8072 | 144.9941 | Northern Metropolitan | 4019.0 |
| 7 | Abbotsford | 16 Maugie St | 4 | h | NaN | SN | Nelson | 06-08-2016 | 2.5 | 3067.0 | 3.0 | 2.0 | 2.0 | 400.0 | 220.0 | 2006.0 | Yarra | -37.7965 | 144.9965 | Northern Metropolitan | 4019.0 |
| 8 | Abbotsford | 53 Turner St | 2 | h | NaN | S | Biggin | 06-08-2016 | 2.5 | 3067.0 | 4.0 | 1.0 | 2.0 | 201.0 | NaN | 1900.0 | Yarra | -37.7995 | 144.9974 | Northern Metropolitan | 4019.0 |
| 9 | Abbotsford | 99 Turner St | 2 | h | NaN | S | Collins | 06-08-2016 | 2.5 | 3067.0 | 3.0 | 2.0 | 1.0 | 202.0 | NaN | 1900.0 | Yarra | -37.7996 | 144.9989 | Northern Metropolitan | 4019.0 |
Last rows
| Suburb | Address | Rooms | Type | Price | Method | SellerG | Date | Distance | Postcode | Bedroom2 | Bathroom | Car | Landsize | BuildingArea | YearBuilt | CouncilArea | Lattitude | Longtitude | Regionname | Propertycount | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 23537 | Wheelers Hill | 12 Strada Cr | 4 | h | 1245000.0 | S | Barry | 26-08-2017 | 16.7 | 3150.0 | 4.0 | 2.0 | 2.0 | 652.0 | NaN | 1981.0 | NaN | -37.90562 | 145.16761 | South-Eastern Metropolitan | 7392.0 |
| 23538 | Williamstown | 77 Merrett Dr | 3 | h | 1031000.0 | SP | Williams | 26-08-2017 | 6.8 | 3016.0 | 3.0 | 2.0 | 2.0 | 333.0 | 133.0 | 1995.0 | NaN | -37.85927 | 144.87904 | Western Metropolitan | 6380.0 |
| 23539 | Williamstown | 83 Power St | 3 | h | 1170000.0 | S | Raine | 26-08-2017 | 6.8 | 3016.0 | 3.0 | 2.0 | 4.0 | 436.0 | NaN | 1997.0 | NaN | -37.85274 | 144.88738 | Western Metropolitan | 6380.0 |
| 23540 | Williamstown | 8/2 Thompson St | 2 | t | 622500.0 | SP | Greg | 26-08-2017 | 6.8 | 3016.0 | 2.0 | 2.0 | 1.0 | NaN | 89.0 | 2010.0 | NaN | -37.86393 | 144.90484 | Western Metropolitan | 6380.0 |
| 23541 | Williamstown | 96 Verdon St | 4 | h | 2500000.0 | PI | Sweeney | 26-08-2017 | 6.8 | 3016.0 | 4.0 | 1.0 | 5.0 | 866.0 | 157.0 | 1920.0 | NaN | -37.85908 | 144.89299 | Western Metropolitan | 6380.0 |
| 23542 | Wyndham Vale | 25 Clitheroe Dr | 3 | u | NaN | PN | Harcourts | 26-08-2017 | 27.2 | 3024.0 | 3.0 | 1.0 | 0.0 | 552.0 | 119.0 | 1990.0 | NaN | -37.90032 | 144.61839 | Western Metropolitan | 5262.0 |
| 23543 | Wyndham Vale | 19 Dalrymple Bvd | 4 | h | NaN | S | hockingstuart | 26-08-2017 | 27.2 | 3024.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | -37.87882 | 144.60184 | Western Metropolitan | 5262.0 |
| 23544 | Yallambie | 17 Amaroo Wy | 4 | h | 1100000.0 | S | Buckingham | 26-08-2017 | 12.7 | 3085.0 | 4.0 | 3.0 | 2.0 | NaN | NaN | NaN | NaN | -37.72006 | 145.10547 | Northern Metropolitan | 1369.0 |
| 23545 | Yarraville | 6 Agnes St | 4 | h | 1285000.0 | SP | Village | 26-08-2017 | 6.3 | 3013.0 | 4.0 | 1.0 | 1.0 | 362.0 | 112.0 | 1920.0 | NaN | -37.81188 | 144.88449 | Western Metropolitan | 6543.0 |
| 23546 | Yarraville | 33 Freeman St | 4 | h | 1050000.0 | VB | Village | 26-08-2017 | 6.3 | 3013.0 | 4.0 | 2.0 | 2.0 | NaN | 139.0 | 1950.0 | NaN | -37.81829 | 144.87404 | Western Metropolitan | 6543.0 |